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ENTERED 



ccg gaa atg 
Pro Glu Met 

gga aat aat 
Gly Asn Asn 



APPLICANT: Director-General of Agency of Industrial Science and Technology 
TITLE OF INVENTION: Heat-resistant enzyme having - glycosidase activity 
FILE REFERENCE: PH-67 9US 

CURRENT APPLICATION NUMBER: US/09/369 , 735 
CURRENT FILING DATE: 1999-08-06 
EARLIER APPLICATION NUMBER: JP 10-222866 
NUMBER OF SEQ ID NOS : 4 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 1 
LENGTH: 1269 
TYPE : DNA 

ORGANISM: Pyrococcus horikoshii 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (1) . . (1269) 
SEQUENCE: 1 
atg ccg.ctg aaa ttc 
Met Pro Leu Lys Phe 
1 5 
tec cat cag ata gag 
Ser His Gin lie Glu 
20 

gag cag att gga aag 
Glu Gin lie Gly Lys 
35 

tgg gaa ctt tac agg 
Trp Glu Leu Tyr Arg 
50 

aat get tat agg ttc 
Asn Ala Tyr Arg Phe 
65 

aat aaa ttt aat gaa 
Asn Lys Phe Asn Glu 
85 

ttg tta ttg acg aga 
Leu Leu Leu Thr Arg 
100 

act age cct etc tgg 
Thr Ser Pro Leu Trp 
115 

aac eta aaa cat tgg 
Asn Leu Lys His Trp 
130 

gaa aaa gtt aaa eta 



etc 
Leu 

gat 
Asp 

tec 
Ser 
70 
gat 
Asp 



ccc tac 
Pro Tyr 
40 

gat att 
Asp lie 

55 
ata gag 
lie Glu 

get ttc 
Ala Phe 



ggt ata act 
Gly lie Thr 

ttc atg aag 
Phe Met Lys 
120 

gaa aag tac 
Glu Lys Tyr 

135 
gta get acc 



ttt 
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Arg 
25 
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105 
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He 
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48 


Leu 


Phe 


Gly Thr 


Ala 


Thr 


Ser 




10 










15 






tgg 


aat 


gat 


tgg 


tgg 


tac 


tat 


96 


Trp 


Asn 


Asp 


Trp 


Trp 
3 0 


Tyr 


Tyr 




tct 


ggt 


aag 


get 


tgc 


aat 


cac 


144 


Ser Gly Lys 


Ala 


Cys 


Asn 


His 










45 










eta 


atg 


acc 


age 


ttg 


ggc 


tat 


192 


Leu 


Met 


Thr 
60 


Ser 


Leu 


Gly 


Tyr 




age 


agg 


eta 


ttc 


cca 


gag 


gaa 


240 


Ser 


Arg 
75 


Leu 


Phe 


Pro 


Glu 


Glu 
80 




aaa 


tac 


egg 


gag 


att 


ata 


gac 


288 


Lys 


Tyr 


Arg 


Glu 


He 


He 


Asp 




90 










95 






ctg 


gtg 


acc 


eta 


cac 


cac 


ttt 


336 


Leu 


Val 


Thr 


Leu 


His 
110 


His 


Phe 




ggt 


ggc 


ttc 


ctt 


agg 


gag 


gag 


384 


Gly Gly Phe 


Leu 


Arg 


Glu 


Glu 










125 










gaa 


aag 


gtt 


get 


gag 


ctt 


tta 


432 


Glu 


Lys 


Val 
140 


Ala 


Glu 


Leu 


Leu 




aat 


gag 


ccg 


atg 


gta 


tac 


gta 


480 



OIPE 



PAGE: 2 RAW SEQUENCE LISTING DATE: 08/26/1999 

PATENT APPLICATION US/09/369,735 TIME: 15:33:15 

Input Set: 1369735. RAW 



45 


olU 


Lys 


Va 1 


Lys 


Leu 


Vol 


Al * 


I Hi. 


Sr A1C 


Asn 


Glu 


Pro 


Met 


Val 


TVr 
j 


Val 




46 












1 RO 

ljU 










155 










160 




47 


acg 


acg 




cat 


r*t a 


acg 


get 




4- rrn 

tyy 


ece 


cca * 


ttc 


att 


a rrcj 


agt 


cca 


528 


48 


Men 


Met 


uiy 


Tyr 


Leu 


1 III. 


Al a 

Ala. 


lyr 




Pro 


XT A. \J 


Phe 


lie 




Ser 


Pro 




49 










ICR 
IO 3 










170 










175 






50 


4_ 4- 4- 

ttc 


aag 


gec 


4-4-4- 


aag 


rrf- a 

yea 


get 


gca 


aac 


t ty 


ttt 


aaa 


err* t 
y 1 1 


eac 


gca 


att 


576 


51 


pne 


Lys 


Aia 


Fjne 


Lys 


val 


Ala 


TV I a 


* A en 
noli 


T .pi i 


T.pu 


Lys 


Ala 


His 


Ala 


lie 




52 








IPO 

lou 










"1 ft R 

X 0 J 










190 








53 


CfCC 




gaa 


r*t t- 


nt 1- 


/-1 -1 4- 
td U 


ggg 


aaa 
CLCLCl 


ttc 


aaa 


gtt 


yy d 


ate 


gta 


aag 


aat 


624 


54 




Tyr 


(jIU 


Leu 


Leu 


nlo 


Vjly 


liy s 


lr I1C 


T .VQ 

j_i y 0 


V d JL 




lie 


Val 


Lys 


Asn 




55 






1 qc 










^ u u 










205 










56 


_, 4- 1_ 
acc 


ccc 


did 


dLd 


etc 


eea 


gcg 


a rr 4- 


y dt 


aarr 
ddy 


rra rr 

y<iy 


a rrrr 
dyy 


gat 


aga 


aaa 


gee 


672 


57 


lie 


Pro 


Tl 

lie 


lie 


Leu 


Pro 


Aia 


i3CI 


Asp 


iiyb 


VJl u. 


Arg 




7\ yrr 

Airg 


Lys 


Ala 




58 




210 










Z ID 










99 0 
^ ^ \j 












59 


get 


gag 


aaa 


get 


rra f- 

gac 


a a 4- 

aac 


4- +- a 

t L.a 


t- f- i" 
t 1 1 


aac 


4- rrrr 

egg 


t dt 


ILL 


ttg 


gat 


y ty 


ata 


720 


60 


TV "I 

Ala 


Glu 


Lys 


Aia 


Asp 


Asn 


Leu 


rlic 


Asn 


irp 


xll 0 


It lie 


llC U. 


Zl en 


nXd 


Tie 




61 


00c 

zzb 










Tin 










^* j 3 










240 




62 


tgg 


— , -.4- 
age 


ggg 


aaa 


cae 


aga 


ggg 


y ta 


t- 1- 1- 
1 1 1 


0 a a 
ddd 


a r^a 
d td 


tat 


a rrrr 

<*yy 


att 


ccc 


caa 


768 


63 


Trp 


Ser 


biy 


Lys 


Tyr 


Arg 


vjiy 


v ai 


irXlc 


Lys 


X 111 


iyr 


Aig 


Tl P 




Gin 




64 




















^ ^ \j 










255 






65 


age 


gac 


gca 


rra 4- 

gac 


4- 4- /-! 

ccc 


dll 


ggg 


y t t 


a a 
ddt 


t d t 


tan 
tdt 


acg 


gee 


age 


gaa 


gta 


816 


66 


Ser 


Asp 


Aia 


Asp 


pne 


Tl £a 

lie 


vjiy 


vai 


Asn 


Tyr 


Tyr 


XIII 




Car 
OC1 


m ii 

V31 Li 


Val 




67 








0 c n 
zb U 










OCR 
Z O D 










270 








68 


agg 


cat 


acc 


tgg 


aac 


cc t 


4- 4- a 

cca 


aaa 


4- 4- r' 

ttt 


t- 1- n 
ttt 


t- 1- 1- 
1 1 1 


gag 


crt cr 

gtg 


aaa 


tta 


y ty 


864 


69 


Arg 


His 


Thr 


Trp 


Asn 


Pro 


Leu 


Lys 


riie 


13 Vl A 


±rllc 


olU 


vdl 




T ,P1 1 
UC Li 


nXd 




70 






O T C 

275 










0 q n 
z o u 










9ft R 
^ 0 j 










71 


gat 


att 


age 


gag 


agg 


aag 


a /-if- 

acc 


caa 


dty 


gga 


tgg 


age 


y 1 1 


fal- 
tdt 


r< r 1 a 
t td 


aaa 


912 


72 


Asp 


lie 


Ser 


CjIU 


Arg 


Lys 


inr 


bin 


Mofr- 
l v lct 


biy 


irp 


Cay 
OCX 


Ua 1 
vdl 


iyr 


Pro 

XT J. ^ 


T A/C! 
XJJr O 




<-> -j 

16 




z y u 










^ 27 3 










300 












74 


gga 


•a 4- -a 

aca 


4- a n 


a 4-rr 

acg 


gee 




aaa 


CLCLCL 


err* t~ 
y 1 1 


ttt 


a rrrr 

o.yy 


tat 


rrrr a 

yyci 


a CJCT 

dyy 


cct 


ctt 


960 


75 


Gly 


lie 


Tyr 


1UIa4- 

Mec 


Til a 

Aia 


Leu 


Lys 


Lys 


Aid 


Ser 


Arg 


iyr 


vjiy 


Aig 


Pro 


Leu 




76 


■J AC 










JlU 










-lie 

J J- J 










320 




1 / 


4- -a 4- 


atL 


acg 


gaa 


aae 


gga 


d t CL 


gcg 


a f <T 
dt y 


ctt 


gat 


gat 


gaa 




aaa 

o.y a. 


y ^-y 


1008 


78 


Tyr 


lie 


inr 




Asn 


vjiy 


Tl 0 

lie 


Ala 
Aid. 


X III. 


Leu 


Asp 




m 11 
01 11 






Val 




7 y 










■IOC 










j j \j 










335 






80 


gaa 


etc 


aca 


acc 


caa 


eae 


etc 


caa 


far 
tdt 


y 1 1 


td t 


ddy 


y t 


ate 


gaa 


gac 


1056 


81 


Glu 


Til-. 

pne 


lie 


lie 


bin 


rilS 


Leu 


V3*1I1 


Tyr 


vdl 


mi 0 




Ala 


lie 


Glu 


Asp 




0 0 
oz 








i a n 










0 *± j 










350 








83 


gg c 


ctg 


gac 


gca 


aga 


ggt 


4- a 0 


4- 4- r< 


td t 


4- rrrr 

i-gg 


4- r«a 

t td 


ttt 


atg 


gat 


aac 


tac 


1104 


84 


Gly 


Leu 


Asp 


Val 


Arg 


Gly 


Tyr 


pne 


Tyr 


Trp 


Ser 


irne 


Mo 4- 

ixie t 


Asp 


Asn 


iyr 




85 






355 










360 










365 










86 


gag 


tgg 


aaa 


gag 


ggg 


ttt 


ggg 


cct 


aga 


ttt 


ggc 


eta 


gtg 


gaa 


gtt 


gat 


1152 


87 


Glu 


Trp 


Lys 


Glu 


Gly 


Phe 


Gly 


Pro 


Arg 


Phe 


Gly 


Leu 


Val 


Glu 


Val 


Asp 




88 




370 










375 










380 












89 


tat 


caa 


acc 


ttc 


gag 


aga 


agg 


ccc 


agg 


aag 


agt 


get 


tac 


gta 


tac 


gga 


1200 


90 


Tyr 


Gin 


Thr 


Phe 


Glu 


Arg 


Arg 


Pro 


Arg 


Lys 


Ser 


Ala 


Tyr 


val 


Tyr 


Gly 




91 


385 










390 










395 










400 




92 


gaa 


att 


gca 


aga 


agt 


aag 


gaa 


ata 


aag 


gat 


gag 


eta 


tta 


aag 


aga 


tat 


1248 


93 


Glu 


He 


Ala 


Arg 


Ser 


Lys 


Glu 


He 


Lys 


Asp 


Glu 


Leu 


Leu 


Lys 


Arg 


Tyr 




94 










405 










410 










415 







OIPE 



PAGE: 3 RAW SEQUENCE LISTING DATE: 08/26/1999 

PATENT APPLICATION US/09/369,735 TIME: 15:33:15 

Input Set: 1369735. RAW 

95 ggc eta cca gaa ctt caa ctt 1269 

96 Gly Leu Pro Glu Leu Gin Leu 

97 420 

98 <210> SEQ ID NO 2 

99 <211> LENGTH: 423 

100 <212> TYPE: PRT 

101 <213> ORGANISM: Pyrococcus horikoshii 

102 <400> SEQUENCE: 2 

103 Met Pro Leu Lys Phe Pro Glu Met Phe Leu Phe Gly Thr Ala Thr Ser 

104 15 10 15 

105 Ser His Gin lie Glu Gly Asn Asn Arg Trp Asn Asp Trp Trp Tyr Tyr 

106 20 25 30 

107 Glu Gin lie Gly Lys Leu Pro Tyr Arg Ser Gly Lys Ala Cys Asn His 

108 35 40 45 

109 Trp Glu Leu Tyr Arg Asp Asp lie Gin Leu Met Thr Ser Leu Gly Tyr 

110 50 55 60 

111 Asn Ala Tyr Arg Phe Ser lie Glu Trp Ser Arg Leu Phe Pro Glu Glu 

112 65 70 75 80 

113 Asn Lys Phe Asn Glu Asp Ala Phe Met Lys Tyr Arg Glu lie lie Asp 

114 85 90 95 

115 Leu Leu Leu Thr Arg Gly lie Thr Pro Leu Val Thr Leu His His Phe 

116 100 105 110 

117 Thr Ser Pro Leu Trp Phe Met Lys Lys Gly Gly Phe Leu Arg Glu Glu 

118 115 120 125 

119 Asn Leu Lys His Trp Glu Lys Tyr lie Glu Lys Val Ala Glu Leu Leu 

120 130 135 140 

121 Glu Lys Val Lys Leu Val Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

122 145 150 155 160 

123 Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

124 165 170 175 

125 Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

126 180 185 190 

127 Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

128 195 200 205 

129 lie Pro lie lie Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

130 210 215 220 

131 Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala lie 

132 225 230 235 240 

133 Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg lie Pro Gin 

134 245 250 255 

135 Ser Asp Ala Asp Phe lie Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

136 260 265 270 

137 Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

138 275 280 285 

13 9 Asp lie Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

140 290 295 300 

141 Gly lie Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 

142 305 310 315 320 

143 Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

144 325 330 335 
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145 Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

146 340 345 350 

147 Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

148 355 360 365 

149 Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

150 370 375 380 

151 Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 

152 385 390 395 400 

153 Glu He Ala Arg Ser Lys Glu He Lys Asp Glu Leu Leu Lys Arg Tyr 

154 405 410 415 

155 Gly. Leu Pro Glu Leu Gin Leu 

156 420 

157 <210> SEQ ID NO 3 

158 <211> LENGTH: 57 

159 <212> TYPE: DNA 

160 <213> ORGANISM: Artificial Sequence 

161 <22 0> FEATURE: 

162 <22 3> OTHER INFORMATION: Description of Artificial Sequence: An upper primer designe 

163 create the Ndel site. 

164 <400> SEQUENCE: 3 

165 taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 57 

166 <210> SEQ ID NO 4 

167 <211> LENGTH: 46 

168 <212> TYPE: DNA 

169 <213> ORGANISM: Artificial Sequence 

170 <220> FEATURE: 

171 <223> OTHER INFORMATION: Description of Artificial Sequence:A lower primer designed 

172 create the BamHI site. 

173 <400> SEQUENCE: 4 

174 tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 
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